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Diagonal matrix 

From Wikipedia, the free encyclopedia 

In linear algebra, a diagonal matrix is a square matrix in which the entries outside the main diagonal (\) are 
all zero. The diagonal entries themselves may or may not be zero. Thus, the matrix D = (dj j) with n columns 
and n rows is diagonal if: 

dij=0^i^ j Vi,i € {l,2,...,n}. 

For example, the following matrix is diagonal: 

"1 0 0 " 
0 4 0. 
0 0 -3 

The term diagonal matrix may sometimes refer to a rectangular diagonal matrix, which is an m-hy-n matrix 
with only the entries of the form i possibly non-zero; for example, 

1 0 0 0 0" 
0 4 0 0 0 . 
0 0 -3 0 0 

» « 

However, in the remainder of this article we will consider only square matrices. 

Any diagonal matrix is also a symmetric matrix. Also, if the entries come from the field R or C, then it is a 
normal matrix as well. 

Equivalently , we can define a diagonal matrix as a matrix that is both upper- and lower- triangular. 

The identity matrix /„ and any square zero matrix are diagonal. A one -dimensional matrix is always diagonal. 



Contents 

■ 1 Scalar matrix 

■ 2 Matrix operations 

■ 3 Other properties 

■ 4 Uses 

■ 5 Operator theory 



1 0 0 

0 4 0 

0 0 --3 ' 

0 0 0 



http://en.wiklpedia.org/wiki/Diagonal_matrix 



Page 1 of 4 



Diagonal matrix - Wilcipedia, the free encyclopedia 



3/16/10 10:08 AIM 



■ 6 See also 

■ 7 References 



Scalar matrix 

A diagonal matrix with all its main diagonal entries equal is a scalar matrix, that is, a scalar multiple U of the 
identity matrix /. Its effect on a vector is scalar multiplication by X. For example, a 3x3 scalar matrix has the 
form: 



The scalar matrices are the center of the algebra of matrices: that is, they are precisely the matrices that 
commute with all other square matrices of the same size. 

For an abstract vector space V (rather than the concrete vector space K^), or more generally a module M over 
a ring R, with the endomorphism algebra End(M) (algebra of linear operators on M) replacing the algebra of 
matrices, the analog of scalar matrices are scalar transformations. Formally, scalar multiplication is a linear 
map, inducing a map R —4 Eind(M), (send a scalar X to the corresponding scalar transformation, 
multiplication by X) exhibiting End(A/) as a /?-algebra. For vector spaces, or more generally free modules 
M — R^'^^^ which the endomorphism algebra is isomorphic to a matrix algebra, the scalar transforms are 
exactly the center of the endomorphism algebra, and similarly invertible transforms are the center of the 
general linear group GL(V), where they are denoted by Z( V), follow the usual notation for the center. 

Matrix operations 

The operations of matrix addition and matrix multiplication are especially simple for diagonal matrices. Write 
diag(<2i for a diagonal matrix whose diagonal entries starting in the upper left corner are aj Then, 
for addition, we have 

diag(ai,...,fl„) + diag(f»i = diag(ai+^l,...,a„+Z7„) 

and for matrix multiplication, 

diag(ai,...,a„) • diag(^i,...,i>^) = diag(ai^>i,...,a„Z?„). 

The diagonal matrix diag(ai,...,a„) is invertible if and only if the entries are all non-zero. In this case, 

we have 

diag(ai,...,a„)"^ = diag(ai"^...,a^-l). 
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In particular, the diagonal matrices form a subring of the ring of all n-by-n matrices. 

Multiplying an n-hy-n matrix A from the left with diagCfl^ ,...,«„) amounts to multiplying the i-th row of A by 
for all i; multiplying the matrix A from the right with diag(ai ,...,a„) amounts to multiplying the i-th column of 
A by ttf for all i. 

Other properties 

The eigenvalues of diagCa^ , a„) are ai , a„ with associated eigenvectors of e i , where the vector ei 
is all zeros except a one in the ith row. The determinant of diagCaj , a„) is the product ...a„. 

The adjugate of a diagonal matrix is again diagonal. 

A square matrix is diagonal if and only if it is triangular and normal. 

Uses 

Diagonal matrices occur in many areas of linear algebra. Because of the simple description of the matrix 
operation and eigenvalues/eigenvectors given above, it is always desirable to represent a given matrix or linear 
map by a diagonal matrix. 

In fact, a given n-hy-n matrix A is similar to a diagonal matrix (meaning that there is a matrix X such thatZ' 

^AX is diagonal) if and only if it has n linearly independent eigenvectors. Such matrices are said to be 
diagonalizable. 

Over the field of real or complex numbers, more is true. The spectral theorem says that every normal matrix is 
unitarily similar to a diagonal matrix (if AA* = A* A then there exists a unitary matrix C/ such that UAU* is 
diagonal). Furthermore, the singular value decomposition implies that for any matrix A, there exist unitary 
matrices U and Vsuch that UAV* is diagonal with positive entries. 

Operator theory 

In operator theory, particularly the study of PDEs, operators are particularly easy to understand, and PDEs 
easy to solve, if the operator is diagonal with respect to the basis one is working with - this corresponds to a 
separable partial differential equation. Thus, a key technique to understand operators is to have a change of 
coordinates - in the language of operators, an integral transform - which changes the basis to an eigenbasis of 
eigenf unctions: which makes the equation separable. An important example of this is the Fourier transform, 
which diagonalizes the heat equation. 

See also 

■ Anti -diagonal matrix 

■ Banded matrix 
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■ Bidiagonal matrix 

■ Diagonally dominant matrix 

■ Diagonalizable matrix 

■ Tridiagonal matrix 

■ Toeplitz matrix 

■ Toral Lie algebra 

■ Circulant matrix 

References 

■ Roger A. Horn and Charles R. Johnson, Matrix Analysis, Cambridge University Press, 1985. ISBN 0- 
521-30586-1 (hardback), ISBN 0-521-38632-2 (paperback). 

Retrieved from "http://en.wikipedia.org/wiki/Diagonal_matrix" 

Categories: Matrices I Matrix normal forms 

■ This page was last modified on 9 March 2010 at 21 :36. 

■ Text is available under the Creative Commons Attribution -ShareAlike License; additional terms may 
apply. See Terms of Use for details. 

Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. 



http://en.wikipedia.org/wiki/Diagonal_matrix 



Page 4 of 4 



Rotation matrix - Wikipedia, the free encyclopedia 



3/16/10 10:08 AM 



Rotation matrix 



From Wikipedia, the free encyclopedia 

In linear algebra, a rotation matrix is any matrix that acts as a rotation in Euclidean space. For example, the 
matrix 

cos^ — sin^ 
sinJ cos6l 

rotates vectors in the xy-plane counterclockwise by an angle of 6. In three dimensions, rotation matrices are among 
the simplest algebraic descriptions of rotations, and are used extensively for computations in geometry, physics, and 
computer graphics. 

Though most applications involve rotations in 2 or 3 dimensions, rotation matrices can be defined for n-dimensional 
space. Algebraically, a rotation matrix is an orthogonal matrix whose determinant is equal to 1: 



Rotation matrices are always square, and are usually assumed to have real entries, though the definition makes 
sense for other scalar fields. The set of all n x « rotation matrices forms a group, known as the rotation group (or 
special orthogonal group). 
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Rotations in two and three dimensions 

In all of this section, the matrices are assumed to act on column vectors, for instance in cartesian coordinates 
systems: [x,y,zj in 3D or [x,y] in 2D (each of them transposed to a column vector). 

Dimension two 

In two dimensions, every rotation matrix has the following form: 



K(9) = 



COS 0 — sin B 
sin 9 cos 6 



(rotation hy 6) . 



This matrix rotates the plane around the origin by an angle of 6. The x axis is rotated towards the y axis. 
The new coordinates (x',y') for a point (x,y) will then be given by: 

X = X cos Q — y sin B 
y =x smB + ycos B 



In an oriented plane 
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If we use the standard right-handed coordinate system, where x axis goes to 
the right and where y axis goes up, the rotation is counterclockwise. If one 
uses the opposite convention, for example x directed to the right and y 
directed to the bottom, the rotation will be clockwise. To get convinced that 
it is still the same rotation, one can think at the plane as a sheet of paper 
being watched alternatively from above and from beneath, by transparency. 

Such non-standard orientations are almost never used in mathematics and 
physics, but they are very common in computer graphics f^^, because they 
match the direction of writing for Western scripts: from the left to the right 
and from the top to the bottom. That's the reason why, in much computer 
software, rotations go clockwise. 

Assuming the standard orientation, for a clockwise rotation, simply replace 0 
hy-O: 




The effect of the rotation matrix in a 
plane oriented in the standard way 



m = 



R(-B) = 




The effect of the same rotation 
matrix in a plane that is oriented 
otherwise 



(coimterclockwise rotation by 0) . 



(clockwise rotation by 0) . 



Common rotations 



Particularly useful are the matrices for 90° and 180" rotations: 
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0 -1 

1 0 



--1 0 
0 -1 

0 1 



(90^ counterclockwise rotation). 



(180° rotation). 



(90*^ clockwise rotation) . 



Dimension three 

See also: Rotation representation 
Basic rotations 

There are three basic rotation matrices in three dimensions: 



R,{0) = 



RM = 



1 0 0 
0 COS 5 — sin^ 

0 sin ^ cos^ 

cos^ 0 sin^ 
0 1 0 

— sin^ 0 cos^ 

cos 9 — sin ^ 0 
sin J cos^ 0 
0 0 1 



These matrices represent counterclockwise rotations of an object relative to fixed coordinate axes, by an angle of 0. 
The direction of the rotation is as follows: R^. rotates the y-axis towards the z-axis, Ry rotates the z-axis towards the 
X-axis, and R^ rotates the x-axis towards the y-axis. 

The resulting coordinates (x',y',z') for a point (x,y,z), for each of these rotations are: 





x' 




z' 


Rx(0) 


X 


ycosd — -2sinj9 


z cosd + ysm d 


Ry(e) 


xcosO + zsmd 


y 


—xsm$-\-z cos $ 
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In an oriented space 

If the 3D space is oriented in the usual way, i.e. x going to the right, y going to the front and z going up, these three 
rotations are counterclockwise when the third (unchanged) axis goes towards the observer. This direction of the 
rotation can be determined by the right-hand rule. 

General rotations 

Other rotation matrices can be obtained from these three using matrix multiplication. For example, the product 
represents a rotation whose yaw, pitch, and roll are a,yS, and y, respectively. Similarly, the product 

RM^M^M 

represents a rotation whose Euler angles are a,P, and y (using the z-x-z convention for Euler angles). 
Finding the rotation matrix 

Every rotation in three dimensions is defined by its axis — a direction that is left fixed by the rotation — and its 
an^le — the amount by which the rotation turns. 

Determining the axis 

Given a rotation matrix R, a vector u parallel to the rotation axis must satisfy 

Rm = u 

since the rotation of u around the rotation axis must result in u- The 
equation above may be solved for u which is unique up to a scalar factor. 

Further, the equation may be rewritten 

R\i = lu [R— /)u = 0 

which shows that u is the null space of R — I. Viewed another way, u is an 

eigenvector corresponding to the eigenvalue A, = 1 (every rotation matrix 
must have this eigenvalue). 

Determining the angle 

To find the angle of a rotation, once the axis of the rotation is known, select 
a vector v perpendicular to the axis. Then the angle of the rotation is the 




I " 

! A rotation R around axis u can be 

decomposed using 3 endomorphisms { 

I P, (I - P), and Q (click to enlarge). I 
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Rotation matrix given an axis and an angle 

For some applications, it is helpful to be able to make a rotation with a given axis. Given a unit vector 
u = (Uj^, Uy, u^), where u^^ + + = 1 , the matrix for a rotation by an angle of Q about an axis in tl 
ofuisf^^: 
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R 



Us{{ly{\ — c) + U^E li^ + (1 — 
U^U^iX — c) — ll^S t£.glts(l — c) 4- W-ajS 



w?. + (1 — ti^)c 



where 



This can be written more concisely as 

= u ® u + €os^( J — u (g) u) + sin^[u]x , 
where [u] ^ is the skew symmetric form of u, and 0 is the outer product. 

If the 3D space is oriented in the usual way, this rotation will be counterclockwise for an observer placed so that the 
axis u goes in his or her direction (Right-hand rule). 

Simpler form of the axis-angle formula 

Rodrigues' rotation formula can be written as 



/? = p + (/ _ p)cos^ + Q sinil 



where 



p = 



U 



T 

= UU , 



/ = 



1 0 0 
0 10 
0 0 1 



Q = 



0 




% 




0 








0 



The matrix / is the 3 x 3 identity matrix. The matrix Q is the skew- symmetric representation of a cross product with 
u. The matrix P is the projection onto the axis of rotation, and / - P is the projection onto the plane orthogonal to 
the axis. 

Properties of a rotation matrix 

The above discussion can be generalised to any number of dimensions. For any rotation matrix Ra^-ig g € 
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/, the identity in / g 

ff^ — R"^ 
det ) = 1 

Examples 



The 2x2 rotation matrix 



The 3x3 matrix 



Q 



0 -1 

1 0 



corresponds to a 90° planar rotation. 
The transpose of the 2x2 matrix 



M 



0.936 
0.352 



0.352 
-0.936 



is its inverse, but since its determinant is -1 this is 
not a rotation matrix; it is a reflection across the 
line II3; = 2jc. 
The 3x3 rotation matrix 



1 
0 
0 



0 

2 

2 



0' 

i 

s 

2 • 



corresponds to a -30° rotation around the x axis in 
three-dimensional space. 
The 3x3 rotation matrix 



Q = 



0.36 0.48 -0.8 
-0.8 0.60 0 
0.48 0.64 0.60 



corresponds to a rotation of approximately 74° 
around the axis (-^/s^/^^/s) in three-dimensional 
space. 

The 3x3 permutation matrix 

To 0 il 



M = 



3 -4 
5 3 

--9 2 



6 



has determinant +1, but its transpose is not its 
inverse, so it is not a rotation matrix. 
The 4x3 matrix 



M = 



0.5 
0.1 
-"0.7 
"^0.5 



-0.1 
0.5 
0.5 

•^0.7 



0.7 
-0.5 

0.5 
■"0.1 



is not square, and so cannot be a rotation matrix; 
yet M^M yields a 3x3 identity matrix (the columns 
are orthonormal). 
The 4x4 matrix 



Q = 



-1 


0 


0 


0 


0 


-1 


0 


0 


0 


0 


_1 


0 


0 


0 


0 





1 



describes an isoclinic rotation, a rotation through 
equal angles (180°) through two orthogonal planes. 
The 5x5 rotation matrix 



Q = 



0 


__1 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


-1 


0 


0 


n 


n 


n 


_i 


n 
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P = 



1 0 0 
0 1 0 



\ 



is a rotation matrix, as is the matrix of any even 
permutation, and rotates through 120° about the 
axis x = y = z. 



0 0 0 0 1 



rotates vectors in the plane of the first two 
coordinate axes 90°, rotates vectors in the plane of 
the next two axes 180°, and leaves the last 
coordinate axis unmoved. 



Geometry 

In Euclidean geometry , a rotation is an example of an isometry, a transformation that moves points without 
changing the distances between them. Rotations are distinguished from other isometrics by two additional 
properties: they leave (at least) one point fixed, and they leave "handedness" unchanged. By contrast, a translation 
moves every point, a reflection exchanges left- and right-handed ordering, and a glide reflection does both. 

A rotation that does not leave "handedness" unchanged is called an Improper Rotation or a Rotoinversion 

If we take the fixed point as the origin of a Cartesian coordinate system, then every point can be given coordinates 
as a displacement from the origin. Thus we may work with the vector space of displacements instead of the points 
themselves. Now suppose {p^ ,. . . are the coordinates of the vector p from the origin, O, to point P. Choose an 
orthonormal basis for our coordinates; then the squared distance to P, by Pythagoras, is 



£{o,P) = Mf = '^p.. 



n 



'J 



r=l 



which we can compute using the matrix multiplication 



IIpIP = [Vl'"Pn] 



T 

= P P 



A geometric rotation transforms lines to lines, and preserves ratios of distances between points. From these 
properties we can show that a rotation is a linear transformation of the vectors, and thus can be written in matrix 
form, 6p. The fact that a. rotation preserves, not just ratios, but distances themselves, we can state as 

P^'p = {Qpf[Q9), 



or 



P^/p = (p^Q^){Qp) 
= P^ 



■^(Q^Q)P. 
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Because tJiis equation holds tor all vectors, p, we conclude that every rotation matrix, Q, satisfies the orthogonality 
condition, 

Q^Q = I' 

Rotations preserve handedness because they cannot change the ordering of the axes, which implies the special 
matrix condition, 

detQ = +l. 

Equally important, we can show that any matrix satisfying these two conditions acts as a rotation. 

Multiplication 

The inverse of a rotation matrix is its transpose, which is also a rotation matrix: 

detQ^ =detQ = +L 
The product of two rotation matrices is a rotation matrix: 

{QiQ2f(QiQ2) = Ql(Q\Qi)Q2 = I 

det(QiQ2) = (detQi)(detQ2) = +1. 
For n greater than 2, multiplication of nxn rotation matrices is not commutative. 





0 


^1 0 






0 0 


1 


Qi = 


1 


0 0 






0 1 


0 




0 


0 1 






_i 0 


0 




'0 


"1 


0" 




"0 0 1" 




Q1Q2 = 


0 


0 


1 


Q2Q1 = 


1 0 0 








0 


0 




0 1 0 





Noting that any identity matrix is a rotation matrix, and that matrix multiplication is associative, we may summarize 
all these properties by saying that the nxn rotation matrices form a group, which for n > 2 is non-abelian. Called a 
special orthogonal group, and denoted by SO(n), SO(«JR), S0„, or SO„(R), the group of nxn rotation matrices is 
isomorphic to the group of rotations in an n-dimensional space. This means that multiplication of rotation matrices 
corresponds to composition of rotations, applied in left-to-right order of their corresponding matrices. 



Ambiguities 

The interpretation of a rotation matrix can be subject to =■=-■ - =.=.= ■= - - - ■ -=rj 

many ambiguities. ^^^^^X*'''*'''"^^ ^ /'^'''''''nr?!^^ ^. 
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Alias or alibi transformation 

The change in a vector's coordinates can indicate a 

turn of the coordinate system (alias) or a turn of 

the vector (alibi). 
Right- or left-handed coordinates 

The matrix can be with respect to a right-handed 

or left-handed coordinate system. 
World or body axes 

The coordinate axes can be fixed or rotate with a 

body. 
Vectors or forms 

The vector space has a dual space of linear forms, 

and the matrix can act on either vectors or forms. 



In most cases the effect of the ambiguity is to transpose or invert the matrix. 

Decompositions 

Independent planes 

Consider the 3x3 rotation matrix 

0.36 0.48 -0.8" 
-0.8 0.60 0 . 
0.48 0.64 0.60_ 

If Q acts in a certain direction, v, purely as a scaling by a factor X, then we have 

Qv = Av, 

so that 

0 = (A/ - Q)v. 
Thus X is a root of the characteristic polynomial for Q, 

0 = det(A/ ~ Q) 

- A 25 A + ^A ™ 1 

= (A^1)(A^^MA + 1). 

Two features are noteworthy. First, one of the roots (or eigenvalues) is 1, which tells us that some direction is 
unaffected by the matrix. For rotations in three dimensions, this is the cais of the rotation (a concept that has no 
meaning in any other dimension). Second, the other two roots are a pair of complex conjugates, whose product is 1 
(the constant term of the quadratic), and whose sum is 2 cos G (the negated linear term). This factorization is of 
interest for 3x3 rotation matrices because the same thing occurs for all of them. (As special cases, for a null rotation 
the "complex conjugates" are both 1 , and for a 180° rotation they are both -1 .) Furthermore, a similar factorization 




Alias and alibi rotations 
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holds for any nxn rotation matrix. If the dimension, n, is odd, there will be a "dangUng" eigenvalue of 1; and for 
any dimension the rest of the polynomial factors into quadratic terms like the one here (with the two special cases 
noted). We are guaranteed that the characteristic polynomial will have degree n and thus n eigenvalues. And since a 
rotation matrix commutes with its transpose, it is a normal matrix, so can be diagonalized. We conclude that every 
rotation matrix, when expressed in a suitable coordinate system, partitions into independent rotations of two- 
dimensional subspaces, at most "/2 of them. 

The sum of the entries on the main diagonal of a matrix is called the trace; it does not change if we reorient the 
coordinate system, and always equals the sum of the eigenvalues. This has the convenient implication for 2x2 and 
3x3 rotation matrices that the trace reveals the angle of rotation, 0, in the two-dimensional (sub-)space. For a 2x2 
matrix the trace is 2 cos(e), and for a 3x3 matrix it is 1+2 cos(0). In the three-dimensional case, the subspace 
consists of all vectors perpendicular to the rotation axis (the invariant direction, with eigenvalue 1). Thus we can 
extract from any 3x3 rotation matrix a rotation axis and an angle, and these completely determine the rotation. 

Sequential angles 

The constraints on a 2x2 rotation matrix imply that it must have the form 



with a^+b^ = 1 . Therefore we may set a = cos 0 and Z> = sin 0, for some angle 0. To solve for 0 it is not enough to 
look at a alone or b alone; we must consider both together to place the angle in the correct quadrant, using a two- 
argument arctangent function. 

Now consider the first column of a 3x3 rotation matrix. 



Although a^+iP- will probably not equal 1 , but some value r'^ < 1 , we can use a slight variation of the previous 
computation to find a so-called Givens rotation that transforms the column to 



zeroing b. This acts on the subspace spanned by the x and y axes. We can then repeat the process for the xz 
subspace to zero c. Acting on the full matrix, these two rotations produce the schematic form 



a 



b 



c 



0 



c 



1 

0 

0 



0 



0 




* 



* 
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Shifting attention to the second column, a Givens rotation of the yz subspace can now zero the z value. This brings 
the full matrix to the form 



which is an identity matrix. Thus we have decomposed Q as 
An nxn rotation matrix will have (n-l)+(n-2)+- • +2+1 , or 

entries below the diagonal to zero. We can zero them by extending the same idea of stepping through the columns 
with a series of rotations in a fixed sequence of planes. We conclude that the set of nxn rotation matrices, each of 

which has rP" entries, can be parameterized by n{n-\)l2 angles. 



In three dimensions this restates in matrix form an observation made by Euler, so 
mathematicians call the ordered sequence of three angles Euler angles. However, the 
situation is somewhat more complicated than we have so far indicated. Despite the 
small dimension, we actually have considerable freedom in the sequence of axis pairs 
we use; and we also have some freedom in the choice of angles. Thus we find many 
different conventions employed when three-dimensional rotations are parameterized for 
physics, or medicine, or chemistry, or other disciplines. When we include the option of 
world axes or body axes, 24 different sequences are possible. And while some 
disciplines call any sequence Euler angles, others give different names (Euler, Cardano, 
Tait-Bryan, roll -pitch-yaw) to different sequences. 





xzyy, 




xyzy, 


yxyy, 


yxzv, 


yzy^v 




zyz^ 


zyxy, 








yzxb 


xyx^ 


zyx\y 


yxyh 


zxyh 


yzyh 


xzy^ 


zyz\y 


xyzb 




yxz\, 



One reason for the large number of options is that, as noted previously, rotations in three dimensions (and higher) 
do not commute. If we reverse a given sequence of rotations, we get a different outcome. This also implies that we 
cannot compose two rotations by adding their corresponding angles. Thus Euler angles are not vectors, despite a 
similarity in appearance as a triple of numbers. 

Nested dimensions 

A 3x3 rotation matrix like 



suggests a 2x2 rotation matrix, 

is embedded in the upper left comer: 

This is no illusion; not just one, but many, copies of n-dimensional rotations are found within (n+l)-dimensional 
rotations, as subgroups. Each embedding leaves one direction fixed, which in the case of 3x3 matrices is the 
rotation axis. For example, we have 
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fixing the x axis, the 3; axis, and the z axis, respectively. The rotation axis need not be a coordinate axis; if u = 
(x,yz) is a unit vector in the desired direction, then 

where cq = cos 0, sq = sin 0, is a rotation by angle 0 leaving axis u fixed. 

A direction in (n+1) -dimensional space will be a unit magnitude vector, which we may consider a point on a 

generalized sphere, S". Thus it is natural to describe the rotation group S0(n+1) as combining SO(n) and 5". A 
suitable formalism is the fiber bundle, 

where for every direction in the "base space", S", the "fiber" over it in the "total space", SO(n+l), is a copy of the 
"fiber space", SO(n), namely the rotations that keep that direction fixed. 

Thus we can build an nxn rotation matrix by starting with a 2x2 matrix, aiming its fixed axis on (the ordinary 
sphere in three-dimensional space), aiming the resulting rotation on S^, and so on up through S"~^.A point on S" 
can be selected using n numbers, so we again have n(n-l)/2 numbers to describe any nxn rotation matrix. 

In fact, we can view the sequential angle decomposition, discussed previously, as reversing this process. The 
composition of n-1 Givens rotations brings the first column (and row) to (1 ,0,. . . ,0), so that the remainder of the 
matrix is a rotation matrix of dimension one less, embedded so as to leave (1,0,..., 0) fixed. 

Skew parameters via Cayley's formula 

When an nxn rotation matrix, Q, does not include -1 as an eigenvalue, so that none of the planar rotations of 
which it is composed are 180° rotations, then Q+I is an invertible matrix. Most rotation matrices fit this description, 

and for them we can show that (<2-/)(<2+/)~^ is a skew- symmetric matrix, A. Thus = -A; and since the diagonal 
is necessarily zero, and since the upper triangle determines the lower one, A contains n(n-l)/2 independent 
numbers. Conveniently, I-A is invertible whenever A is skew- symmetric; thus we can recover the original matrix 
using the Cay ley transform, 

which maps any skew- symmetric matrix A to a rotation matrix. In fact, aside from the noted exceptions, we can 
produce any rotation matrix in this way. Although in practical applications we can hardly afford to ignore 180° 
rotations, the Cayley transform is still a potentially useful tool, giving a parameterization of most rotation matrices 
without trigonometric functions. 

In three dimensions, for example, we have (Cayley 1846) 

If we condense the skew entries into a vector, (x,y^), then we produce a 90° rotation around the x axis for (1 ,0,0), 
around the y axis for (0,1 ,0), and around the z axis for (0,0,1). The 180° rotations are just out of reach; for, in the 
limit as x goes to infinity, (x,0,0) does approach a 180° rotation around the x axis, and similarly for other 
directions. 
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Lie theory 

Lie group 

We have established that nxn rotation matrices form a group, the special orthogonal group, SO(n). This algebraic 
structure is coupled with a topological structure, in that the operations of multiplication and taking the inverse 
(which here is merely transposition) are continuous functions of the matrix entries. Thus SO(n) is a classic example 
of a topological group. (In purely topological terms, it is a compact manifold.) Furthermore, the operations are not 
only continuous, but smooth, so SO(n) is a differentiable manifold and a Lie group (Baker (2003); Fulton & Harris 
(1991)). 

Most properties of rotation matrices depend very little on the dimension, n; yet in Lie group theory we see 
systematic differences between even dimensions and odd dimensions. As well, there are some irregularities below 
n = 5; for example, S0(4) is, anomalously, not a simple Lie group, but instead isomorphic to the product of and 
SO(3). 

Lie algebra 

Associated with every Lie group is a Lie algebra, a linear space equipped with a bilinear alternating product called 
a bracket. The algebra for SO(n) is denoted by 

and consists of all skew- symmetric nxn matrices (as implied by differentiating the orthogonality condition, / = 
Q^Q). The bracket, [Ai ^al* of two skew-symmetric matrices is defined to be A1A2-A2A1 , which is again a skew- 
symmetric matrix. This Lie algebra bracket captures the essence of the Lie group product via infinitesimals. 

For 2x2 rotation matrices, the Lie algebra is a one-dimensional vector space, multiples of 

Here the bracket always vanishes, which tells us that, in two dimensions, rotations commute. Not so in any higher 
dimension. For 3x3 rotation matrices, we have a three-dimensional vector space with the convenient basis 

The Lie brackets of these generators are as follows 

We can conveniently identify any matrix in this Lie algebra with a vector in R^, 

Under this identification, the so(3) bracket has a memorable description; it is the vector cross product, 

The matrix identified with a vector v is also memorable, because 

Notice this implies that v is in the null space of the skew-symmetric matrix with which it is identified, because vxv 
is always the zero vector. 
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Exponential map 

Connecting the Lie algebra to the Lie group is the exponential map, which we define using the familiar power 
series for e^ (Wedderbum 1934, §8.02), 

For any skew-symmetric A, exp(A) is always a rotation matrix. 

An important practical example is the 3x3 case, where we have seen we can identify every skew- symmetric matrix 
with a vector w = u9, where u = (x,y,z) is a unit magnitude vector. Recall that u is in the null space of the matrix 
associated with w, so that if we use a basis with u as the z axis the final column and row will be zero. Thus we 
know in advance that the exponential matrix must leave u fixed. It is mathematically impossible to supply a 
straightforward formula for such a basis as a function of u (its existence would violate the hairy ball theorem), but 
direct exponentiation is possible, and yields 

where c = cos 5 = sin We recognize this as our matrix for a rotation around axis u by angle 9. We also note 
that this mapping of skew- symmetric matrices is quite different from the Cayley transform discussed earlier. 

In any dimension, if we choose some nonzero A and consider all its scalar multiples, exponentiation yields rotation 
matrices along a geodesic of the group manifold, forming a one-parameter subgroup of the Lie group. More 
broadly, the exponential map provides a homeomorphism between a neighborhood of the origin in the Lie algebra 
and a neighborhood of the identity in the Lie group. In fact, we can produce any rotation matrix as the exponential 
of some skew- symmetric matrix, so for these groups the exponential map is a surjection. 

Baker-Campbell-Hausdorff formula 

Suppose we are given A and B in the Lie algebra. Their exponentials, exp(A) and exp(5), are rotation matrices, 
which we can multiply. Since the exponential map is a surjection, we know that for some C in the Lie algebra, 
exp(A)exp(B) = exp(C), and we write 

When exp(A) and exp(5) commute (which always happens for 2x2 matrices, but not higher), then C = A+B, 
mimicking the behavior of complex exponentiation. The general case is given by the BCH formula, a series 
expanded in terms of the bracket (Hall 2004, Ch. 3; Varadarajan 1984, §2.15). For matrices, the bracket is the same 
operation as the commutator, which detects lack of commutativity in multiplication. The general formula begins as 
follows. 

Representation of a rotation matrix as a sequential angle decomposition, as in Euler angles, may tempt us to treat 
rotations as a vector space, but the higher order terms in the BCH formula reveal that to be a mistake. 

We again take special interest in the 3x3 case, where [AJB] equals the cross product, AxB. If A and B are linearly 
independent, then A, B, and AxB can be used as a basis; if not, then A and B commute. And conveniently, in this 
dimension the summation in the BCH formula has a closed form (Eng0 2001) as a.A+^B+yiAxB). 

Spin group 
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We again take special interest in the 3x3 case, where [A^] equals the cross product, AxB. If A and B are linearly 
independent, then A, B, and AxB can be used as a basis; if not, then A and B commute. And conveniently, in this 
dimension the summation in the BCH formula has a closed form (Eng0 2001) as aA+^B+y(AxB). 

Spin group 

The Lie group of nxn rotation matrices, SO(n), is a compact and path -connected manifold, and thus locally 
compact and connected. However, it is not simply connected, so Lie theory tells us it is a kind of "shadow" (a 
homomorphic image) of a universal covering group. Often the covering group, which in this case is the spin group 
denoted by Spin(n), is simpler and more natural to work with (Baker 2003, Ch. 5; Fulton & Harris 1991, pp. 299- 
315). 

In the case of planar rotations, SO(2) is topologically a circle, . Its universal covering group, Spin(2), is 
isomorphic to the real line, R, under addition. In other words, whenever we use angles of arbitrary magnitude, 
which we often do, we are essentially taking advantage of the convenience of the "mother space". Every 2x2 
rotation matrix is produced by a countable infinity of angles, separated by integer multiples of 2jt. Correspondingly, 
the fundamental group of S0(2) is isomorphic to the integers, Z. 

In the case of spatial rotations, S0(3) is topologically equivalent to three-dimensional real projective space, RP-'. Its 
universal covering group, Spin(3), is isomorphic to the 3-sphere, S^. Every 3x3 rotation matrix is produced by two 
opposite points on the sphere. Correspondingly, the fundamental group of SO(2) is isomorphic to the two-element 
group, Z2. We can also describe Spin(3) as isomorphic to quaternions of unit norm under multiplication, or to 
certain 4x4 real matrices, or to 2x2 complex special unitary matrices. 

Concretely, a unit quaternion, q, with 
produces the rotation matrix 

This is our third version of this matrix, here as a rotation around non-unit axis vector (x,y^) by angle 26, where 
cos Q = w and Isin 91 = ll(jc,}',z)ll. (The proper sign for sin 0 is implied once the signs of the axis components are 
decided.) 

Many features of this case are the same for higher dimensions. The coverings are all two-to-one, with SO(n), n > 2, 
having fundamental group Z2. The natural setting for these groups is within a Clifford algebra. And the action of 

the rotations is produced by a kind of "sandwich", denoted by qvq*. 

Infinitesimal rotations 

The matrices in the Lie algebra are not themselves rotations; the skew- symmetric matrices are derivatives, 
proportional differences of rotations. An actual "differential rotation", or infinitesimal rotation matrix has the form 

where dQ is vanishingly small. These matrices do not satisfy all the same properties as ordinary finite rotation 

matrippc iinHfr thf» 11G119I treatment nf infinitf»cimaIo (drtlriQfMn Pnr»lp Xr QnfVn 9009 S4 R\ Tr\ iinHprctnnH what 
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this means, consider 

We first test the orthogonality condition, Q^Q = I. The product is 

differing from an identity matrix by second order infinitesimals, which we discard. So to first order, an infinitesimal 
rotation matrix is an orthogonal matrix. Next we examine the square of the matrix. 

Again discarding second order effects, we see that the angle simply doubles. This hints at the most essential 
difference in behavior, which we can exhibit with the assistance of a second infinitesimal rotation. 

Compare the products dA^dAy and dAydA^^. 

Since dQ d^ is second order, we discard it; thus, to first order, multiplication of infinitesimal rotation matrices is 
commutative. In fact, 

again to first order. Put in other words, the order in which infinitesimal rotations are applied is irrelevant, this 
useful fact makes, for example, derivation of rigid body rotation relatively simple. 

But we must always be careful to distinguish (the first order treatment of) these infinitesimal rotation matrices from 
both finite rotation matrices and from derivatives of rotation matrices (namely skew-symmetric matrices). Control 
the behavior of finite rotation matrices in the BCH formula with that of infinitesimal rotation matrices, where all the 
commutator terms will be second order infinitesimals so we do have a vector space. 

Conversions 

Main article: Rotation representation (mathematics)#Conversion formulae between representations 

We have seen the existence of several decompositions that apply in any dimension, namely independent planes, 
sequential angles, and nested dimensions. In all these cases we can either decompose a matrix or construct one. We 
have also given special attention to 3x3 rotation matrices, and these warrant further attention, in both directions 
(Stuelpnagel 1964). 

Quaternion 

Main article: Quaternions and spatial rotation 
Given the unit quaternion q= (w^,yz), the equivalent 3x3 rotation matrix is 

Now every quaternion component appears multiplied by two in a term of degree two, and if all such terms are zero 
what's left is an identity matrix. This leads to an efficient, robust conversion from any quaternion — whether unit. 



http://en.wikipedia.org/wiki/Rotation_matrix 



Page 17 of 24 



Rotation matrix - Wil<ipedia, the free encyclopedia 

nonunit, or even zero — to a 3x3 rotation matrix. 



3/16/10 10:08 AM 



Iwq = w'2 + K"! + y"2 + z*2 

jif Nq > 0.0 then s = 2/Nq else s = 0.0 

;X = x*s; Y = y*s; Z = z*S 

yx = w*X; wY = w*Y; wZ = w*Z 

IxX = x*X; xY = x*Y; xZ = x*Z 

lyY = y*Y; yZ = y*Z; zZ = z*Z 

l[ 1.0-(yY+zZ) xY-wZ xZ+wY ] 

|[ xY+wZ 1.0-(xX+2Z) yZ-wX ] 

;[ xZ-wY yZ+wX 1.0-(xX+yY) ] 



Freed from the demand for a unit quaternion, we find that nonzero quaternions act as homogeneous coordinates for 
3x3 rotation matrices. The Cayley transform, discussed earUer, is obtained by scaling the quaternion so that its w 
component is 1. For a 180° rotation around any axis, w will be zero, which explains the Cayley limitation. 

The sum of the entries along the main diagonal (the trace), plus one, equals A-Aix^+y^+t^), which is 4w^. Thus we 
can write the trace itself as 2w^+2w^-l; and from the previous version of the matrix we see that the diagonal 
entries themselves have the same form: Ix^+lw^-l, ly^+lw^-l, and 2z^+2vi;^-l. So we can easily compare the 
magnitudes of all four quaternion components using the matrix diagonal. We can, in fact, obtain all four magnitudes 
using sums and square roots, and choose consistent signs using the skew-symmetric part of the off-diagonal entries. 



|W = 0.5*sqrt(l+Qxx+Qyy+Qzz) 

jx = copysign(0.5*sqrt(l+Q3jx-Qyy-Qzz) 'Qzy-Qyz) 
jy = copysign(0.5*sqrt(l-Qj,j,+Qyy-Q22) ,Qxz-Qzx) 
iz = copysign(0.5*sqrt(l-Qxx-Qyy+Qzz) 'Qyx-Qxy) 



where copysign(x,y) is x with the sign of y: 



Alternatively, use a single square root and division 



I 

•t = Qxx+Qyy+Qzz 

■r = sqrt(l+t) 

;s = 0.5/r 

y = 0.5*r 

!x = (Q2y-Qy2)*s 

iy = (Qxz-Qzx)*s 
:z = (Qyx-Qxy)*s 



This is numerically stable so long as the trace, t, is not negative; otherwise, we risk dividing by (nearly) zero. In 
that case, suppose Q^x is the largest diagonal entry, so x will have the largest magnitude (the other cases are 
similar); then the following is safe. 



jr = sqrt(l+Qxx-Qyy-Q22) 

;s = 0.5/r 

•w = (Q2y-QyJ*s 

Ix = 0.5*r 

iy = (Qxy+Qyx)*s 

;z = (Qzx+Qxz)*s 
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If the matrix contains significant error, such as accumulated numerical error, we may construct a symmetric 4x4 
matrix, 



and find the eigenvector, (w,k^z), of its largest magnitude eigenvalue. (If Q is truly a rotation matrix, that value 
will be 1 .) The quaternion so obtained will correspond to the rotation matrix closest to the given matrix (Bar- 
Itzhack 2000). 

Polar decomposition 

If the nxn matrix Mis non-singular, its columns are linearly independent vectors; thus the Gram-Schmidt process 
can adjust them to be an orthonormal basis. Stated in terms of numerical linear algebra, we convert M to an 
orthogonal matrix, Q, using QR decomposition. However, we often prefer a Q "closest" to M, which this method 
does not accomplish. For that, the tool we want is the polar decomposition (Fan & Hoffman 1955; Higham 1989). 

To measure closeness, we may use any matrix norm invariant under orthogonal transformations. A convenient 
choice is the Frobenius norm, llg-Mllp, squared, which is the sum of the squares of the element differences. 
Writing this in terms of the trace, Tr, our goal is, 

■ Find Q minimizing Tr( (Q-M)^{Q-M) ), subject to Q^Q = I. 

Though written in matrix terms, the objective function is just a quadratic polynomial. We can minimize it in the 
usual way, by finding where its derivative is zero. For a 3x3 matrix, the orthogonality constraint implies six scalar 
equalities that the entries of Q must satisfy. To incorporate the constraint(s), we may employ a standard technique, 
Lagrange multipliers, assembled as a symmetric matrix, Y. Thus our method is: 

■ Differentiate Tr( {Q-M)^{Q-M) + (Q^Q-r)Y) with respect to (the entries of) Q, and equate to zero. 



Consider a 2x2 example. Including constrmnts, we seek to minimize 

Talcing the derivative with respect to !2xx» Cxy Cyx' Qyy turn, we assemble a matrix. 



In general, we obtain the equation 
so that 



where Q is orthogonal and S is 

symmetric. To ensure a minimum, the F matrix (and hence S) must be positive definite. Linear algebra calls QS the 
polar decomposition of M, with S the positive square root of = M^M. 



When M is non-singular, the Q and S factors of the polar decomposition are uniquely determined. However, the 
determinant of 5* is positive because S is positive definite, so Q inherits the sign of the determinant of M. That is, Q 
is only guaranteed to be orthogonal, not a rotation matrix. This is unavoidable; an M with negative determinant has 
no uniquely-defined closest rotation matrix. 

Axis and angle 

To efficiently construct a rotation matrix from an angle 9 and a unit axis u, we can take advantage of symmetry and 
skew-svmmetrv within the entries. 
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Axis and angle 

To efficiently construct a rotation matrix from an angle 8 and a unit axis u, we can take advantage of symmetry and 
skew-symmetry within the entries. 



|c = cos(6); s = sin(6); C = 1-c I 

!xs = x*s; ys = y*s; zs = z*s I 

IxC = x*C; yC = y*C; zC = z*C I 

'xyC = x*yC; yzC = y*zC; zxC = z*xC j 

|[ x*xC+c xyC-zs zxC+ys ] ] 

\l xyC+zs y*yC+c yzC-xs ] 1 

',[ zxC-ys yzC+xs z*zC+o ] ! 

L I 

Determining an axis and angle, like determining a quaternion, is only possible up to sign; that is, (u,G) and (-u,-0) 
correspond to the same rotation matrix, just like q and -q. As well, axis-angle extraction presents additional 
difficulties. The angle can be restricted to be from 0° to 180°, but angles are formally ambiguous by multiples of 
360°. When the angle is zero, the axis is undefined. When the angle is 180°, the matrix becomes symmetric, which 
has implications in extracting the axis. Near multiples of 180°, care is needed to avoid numerical problems: in 
extracting the angle, a two-argument arctangent with atan2 (sin 0 , cos 0 ) equal to 0 avoids the insensitivity 
of arccosine; and in computing the axis magnitude to force unit magnitude, a brute-force approach can lose 
accuracy through underflow (Moler & Morrison 1983). 

A partial approach is as follows. 



I 

X = Qzy-Qyz 

'f = Qyx-Qxy 

Ir = hypot(x,hypot(y, z) ) 

it = Qxx+Qyy+Qzz 

;e = atan2(r,t-l) 



The x,y, and z components of the axis would then be divided by r . A fully robust approach will use different code 
when t is negative, as with quaternion extraction. When r is zero because the angle is zero, an axis must be 
provided from some source other than the matrix. 

Euler angles 

Complexity of conversion escalates with Euler angles (used here in the broad sense). The first difficulty is to 
establish which of the twenty-four variations of Cartesian axis order we will use. Suppose the three angles are Gj , 
02, 03; physics and chemistry may interpret these as 

while aircraft dynamics may use 

One systematic approach begins with choosing the right-most axis. Among all permutations of {x,y^), only two 
place that axis first; one is an even permutation and the other odd. Choosing parity thus establishes the middle axis. 
That leaves two choices for the left-most axis, either duplicating the first or not. These three choices gives us 
3x2x2 =12 variations; we double that to 24 by choosing static or rotating axes. 
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One systematic approach begins with choosing the right-most axis. Among all permutations of {x,yz), only two 
place that axis first; one is an even permutation and the other odd. Choosing parity thus establishes the middle axis. 
That leaves two choices for the left-most axis, either duplicating the first or not. These three choices gives us 
3x2x2 = 12 variations; we double that to 24 by choosing static or rotating axes. 

This is enough to construct a matrix from angles, but triples differing in many ways can give the same rotation 
matrix. For example, suppose we use the zyz convention above; then we have the following equivalent pairs: 

(90°, 45°, -105°) = (-270°, -315°, 255°) multiples of 360° 
(72°, 0°, 0°)= (40°, 0°, 32°) singular alignment 
(45°, 60°, -30°) s (-135°, -60°, 150°) bistable flip 

Angles for any order can be found using a concise common routine (Herter & Lott 1993; Shoemake 1994). 

The problem of singular alignment, the mathematical analog of physical gimbal lock, occurs when the middle 
rotation aligns the axes of the first and last rotations. It afflicts every axis order at either even or odd multiples of 
90°. These singularities are not characteristic of the rotation matrix as such, and only occur with the usage of Euler 
angles. 

The singularities are avoided when considering and manipulating the rotation matrix as orthonormal row vectors (in 
3D applications often named 'right'- vector, 'up'- vector and 'out'-vector) instead of as angles. The singularities are 
also avoided when working with quaternions. 

Uniform random rotation matrices 

We sometimes need to generate a uniformly distributed random rotation matrix. It seems intuitively clear in two 
dimensions that this means the rotation angle is uniformly distributed between 0 and 2k. That intuition is correct, 
but does not carry over to higher dimensions. For example, if we decompose 3x3 rotation matrices in axis-angle 
form, the angle should not be uniformly distributed; the probability that (the magnitude of) the angle is at most 0 
should be Vjt(e - sin 0), for 0 9 < 3i. 

Since SO(n) is a connected and locally compact Lie group, we have a simple standard criterion for uniformity, 
namely that the distribution be unchanged when composed with any arbitrary rotation (a Lie group "translation"). 
This definition corresponds to what is called Haar measure. Leon, Masse & Rivest (2006) show how to use the 
Cayley transform to generate and test matrices according to this criterion. 

We can also generate a uniform distribution in any dimension using the subgroup algorithm of Diaconis & 
Shashahani (1987). This recursively exploits the nested dimensions group structure of SO(n), as follows. Generate a 
uniform angle and construct a 2x2 rotation matrix. To step from n to n+1 , generate a vector v uniformly distributed 
on the /T-sphere, 5", embed the nxn matrix in the next larger size with last column (0,...,0,1), and rotate the larger 
matrix so the last column becomes v. 

As usual, we have special alternatives for the 3x3 case. Each of these methods begins with three independent 
random scalars uniformly distributed on the unit interval. Arvo (1992) takes advantage of the odd dimension to 
change a Householder reflection to a rotation by negation, and uses that to aim the axis of a uniform planar rotation. 
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Another method uses unit quaternions. Multiplication of rotation matrices is homomorphic to multiplication of 
quaternions, and multiplication by a unit quaternion rotates the unit sphere. Since the homomorphism is a local 
isometry, we immediately conclude that to produce a uniform distribution on S0(3) we may use a uniform 

distribution on S'^. 

Euler angles can also be used, though not with each angle uniformly distributed (Murnaghan 1962; Miles 1965). 

For the axis-angle form, the axis is uniformly distributed over the unit sphere of directions, 5^, while the angle has 
the non-uniform distribution over [0,Jt] noted previously (Miles 1965). 

See also 

■ Rotation representation 

■ Isometry 

■ Orthogonal matrix 

■ Rodrigues' rotation formula 

■ Yaw -pitch- roll system 

■ Plane of rotation 

Notes 

1 . W3C recommendation (2003). "Scalable Vector Graphics ~ the initial coordinate system" 
(http://www.w3.Org/TR/SVG/coords.html#InitialCoordinateSystem) . 

http://www .w3 .org/TR/SVG/coords .html#InitialCoordinateSystem . 

2. Murray, Glenn (2005). "Rotation About an Arbitrary Axis in 3 Dimensions" 
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Permutation matrix 

From Wikipedia, the free encyclopedia 

In mathematics, in matrix theory, a permutation matrix is 

a square binary matrix that has exactly one entry 1 in each 
row and each column and O's elsewhere. Each such matrix 
represents a specific permutation of m elements and, when 
used to multiply another matrix, can produce that 
permutation in the rows or columns of the other matrix. 
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Definition 

Given a permutation 3t of m elements, 

w : {1, . . . , m} ^ {1, . . . , m} 
given in two-line form by 

1 2 • m \ 
7r(l) 7r(2) ^(m)j' 

its permutation matrix is the m x m matrix whose entries are all 0 except that in row /, the entry Jt(/) equals 
1 . We may write 
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Matrices describing the permutations of 3 

elements. i 

The product of two permutation matrices is a | 

permutation matrix as well ^ 
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P = 



ejr(l) 
®7r(m) 



where Gj denotes a row vector of length m with 1 in the 7th position and 0 in every other position. 

Properties 

Given two permutations k and o of m elements and the corresponding permutation matrices Pj^ and 
p p — p 



This somewhat unfortunate rule is a consequence of the definitions of multiplication of permutations 
(composition of bijections) and of matrices, and of the choice of using the vectors ©^(i) as rows of the 
permutation matrix; if one had used columns instead then the product above would have been equal to 
with the permutations in their original order. 



As permutation matrices are orthogonal matrices (i.e., P^Pj = /)> the inverse matrix exists and can be 
written as 

Multiplying Pj^ times a column vector g will permute the rows of the vector: 







51 




9Mt) 






92 




9^m 


» 




9n^ 




9iir{n)^ 



Now applying after applying Pj^ gives the same result as applying directly, in accordance with the 
above multiplication rule: call p^g = g', in other words 

9i = 9Ai) 
for all I, then 
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Multiplying a row vector h times P^r will permute the columns of the vector by the inverse of Fjr: 



hP^ = [hi h2 . . . hn 



®7r(n) 



Again it can be checked that (hFff)P^ = h-P^roa- 

Notes 

Let 5„ denote the symmetric group, or group of permutations, on {1 ,2,...,n}. Since there are n\ permutations, 
there are n\ permutation matrices. By the formulas above, the w x n permutation matrices form a group under 
matrix multiplication with the identity matrix as the identity element. 

If (1) denotes the identity permutation, then P^^^ is the identity matrix. 

One can view the permutation matrix of a permutation a as the permutation a of the columns of the identity 
matrix /, or as the permutation of the rows of /. 

A permutation matrix is a doubly stochastic matrix. The Birkhoff-von Neumann theorem says that every 
doubly stochastic matrix is a convex combination of permutation matrices of the same order and the 
permutation matrices are the extreme points of the set of doubly stochastic matrices. That is, the Birkhoff 
polytope, the set of doubly stochastic matrices, is the convex hull of the set of permutation matrices. 

The product PM, premultiplying a matrix M by a permutation matrix P, permutes the rows of M; row / moves 
to row T^i). Likewise, MP permutes the columns of M. 

The map 5"^ -> A C GL(n, Z2) is a faithful representation. Thus, lAI = n!. 

The trace of a permutation matrix is the number of fixed points of the permutation. If the permutation has 
fixed points, so it can be written in cycle form as Jt = (ai)(a2)...(a/)o where a has no fixed points, then 
^ai»^a2'"*'*«* are eigenvectors of the permutation matrix. 

From group theory we know that any permutation may be written as a product of transpositions. Therefore, 
any permutation matrix P factors as a product of row-interchanging elementary matrices, each having 
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determinant -1 . Thus the determinant of a permutation matrix P is just the signature of the corresponding 
permutation. 



Examples 

The permutation matrix Fjj corresponding to the permutation r^r 



2 3 4 5 
4 2 5 3 
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Given a vector g, 
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Solving for P 

If we are given two matrices A and B which are known to be related as B = PAP ~ ^ , but the permutation 
matrix P itself is unknown, we can find P using eigenvalue decomposition: 

A = QAm:^ 

B = Qb^Qb^ 

where A is a diagonal matrix of eigenvalues, and Qj^ and are the matrices of eigenvectors. The 
eigenvalues of A and B will always be the same, and P can be computed as p = Q^Q^- In other words, 
PQj^ — Qb, which means that the eigenvectors of B are simply permuted eigenvectors of A. 

[ Note .The above processing of solving P is wrong. The right statement should be: there exist eigenvalue 
decomposition :^4 = Q^AQ^^ 

B = Qb^Qb^^ ^^"^^ P = QbQ^a ^ P^^'n^utation solution of 5 = PAP ~ ^ ] 
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Example 

Given the two matrices 



A = 



B 



0 1 2 

1 0 1.5 

2 1.5 0 

1 1.5 

1 0 2 

1.5 2 0 



and the transformation matrix that changes A into B is 



P = 



0 1 0' 

1 0 0 
0 0 1 



which says that the first & second row as well as the first & second column of A have been swapped to yield 
(and visual inspection confirms this). 

After finding the eigenvalues of both A and B and diagonalizing them into a diagonal matrix is 



A = 



2.09394 0 0 
0 0.9433954 0 
0 0 3.037337 



and the Qy^ matrix of eigenvectors for A is 



-0.60130 0.54493 0.58437 
-0.25523 ^0.82404 0.50579 
0.75716 0.15498 0.63458 



and the Q ^ matrix of eigenvectors for B is 



Qi 



-0.25523 "-0.82404 
^0.60130 0.54493 
0.75716 0.15498 



-0.50579 

-0.58437 
-0.63458 
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Comparing the first eigenvector (i.e., the first column) of both we can write the first column of P by noting 
that the first element (Ga(1 ,1) ~ ~ 0.60130) matches the second element (2^(2,1))' ^^"^ ^® P^* ^ ^ *® 
second element of the first column of P. Repeating this procedure, we match the second element (Ga(2,1)) ^ 
the first element (2j5(l,l))' thus we put a 1 in the first element of the second column of P; and the third 
element (<2a(3,1)) the third element (25(3,1))» thus we put a 1 in the third element of the third column of 
P. 

The resulting P matrix is: 





0 


1 


0" 




1 


0 


0 




0 


0 


1 



And comparing to the P matrix from above, we find they are the same. 

Explanation 

A permutation matrix will always be in the form 

where e^. represents the ith basis vector (as a row) for R/, and where 
"1 2 ... j" 

is the permutation form of the permutation matrix. 

Now, in performing matrix multiplication, one essentially forms the dot product of each row of the first matrix 
with each column of the second. In this instance, we will be forming the dot product of each column of this 
matrix with the vector with elements we want to permute. That is, for example, = (80,-^5) , 
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So, the product of the permutation matrix with the vector v above, will be a vector in the form (goj, ga2' 
g^p, and that this then is a permutation of v since we have said that the permutation form is 

/I 2 ... j\ 

So, permutation matrices do indeed permute the order of elements in vectors multiplied with them. 

Matrices with constant line sums 

The sum of the values in each column or row in a permutation matrix adds up to exactly 1 . A possible 
generalization of permutation matrices is nonnegative integral matrices where the values of each column and 
row add up to a constant number c. A matrix of this sort is known to be the sum of c permutation matrices. 

For example in the following matrix M each column or row adds up to 5. 
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3 



This matrix is the sum of 5 permutation matrices. 

See also 

■ Alternating sign matrix 

■ Generalized permutation matrix 
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